FireCite: Lightweight real-time reference string extraction from webpages
نویسندگان
چکیده
We present FireCite, a Mozilla Firefox browser extension that helps scholars assess and manage scholarly references on the web by automatically detecting and parsing such reference strings in real-time. FireCite has two main components: 1) a reference string recognizer that has a high recall of 96%, and 2) a reference string parser that can process HTML web pages with an overall F1 of .878 and plaintext reference strings with an overall F1 of .97. In our preliminary evaluation, we presented our FireCite prototype to four academics in separate unstructured interviews. Their positive feedback gives evidence to the desirability of FireCite’s citation management capabilities.
منابع مشابه
A Novel Architecture for Detecting Phishing Webpages using Cost-based Feature Selection
Phishing is one of the luring techniques used to exploit personal information. A phishing webpage detection system (PWDS) extracts features to determine whether it is a phishing webpage or not. Selecting appropriate features improves the performance of PWDS. Performance criteria are detection accuracy and system response time. The major time consumed by PWDS arises from feature extraction that ...
متن کاملWikipedia Verification Check: A Chrome Browser Extension
In this paper we present Version 1.0 of an implementation of a Wiki reference parser with a light-weight plugin in the form of a Google Chrome [Google 2017] Extension with Javascript. The output of the parser is a “verification score” for any Wikipedia page, constructed from a combination of scores derived from reference accessibility and quality. The extension presented herein works from a pre...
متن کاملValidation of Reference Genes for Real Time PCR Normalization in Milk Somatic Cells of Holstein Dairy Cattle
Real time-qPCR is the most reliable method for evaluation of mRNA expression levels. However, to obtain accurate results, selection of suitable reference genes is necessary for normalizing the real-time qPCR data. The aim of this research was to validate the expression stability of three potential reference genes (ACTB, GAPDH and UXT) in milk somatic cells of Holstein dairy cattle under differe...
متن کاملAutomatic Detection of Webpages that Share the Same Web Template
Template extraction is the process of isolating the template of a given webpage. It is widely used in several disciplines, including webpages development, content extraction, block detection, and webpages indexing. One of the main goals of template extraction is identifying a set of webpages with the same template without having to load and analyze too many webpages prior to identifying the tem...
متن کاملA PRACTICAL APPROACH TO REAL-TIME DYNAMIC BACKGROUND GENERATION BASED ON A TEMPORAL MEDIAN FILTER
In many computer vision applications, segmenting and extraction of moving objects in video sequences is an essential task. Background subtraction, by which each input image is subtracted from the reference image, has often been used for this purpose. In this paper, we offer a novel background-subtraction technique for real-time dynamic background generation using color images that are taken fro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009